智能论文笔记

A Survey on Influence Maximization: From an ML-Based Combinatorial Optimization

Yandi Li , Haobo Gao , Yunxuan Gao , Jianxiong Guo , Weili Wu

分类：机器学习

2022-11-06

Influence Maximization (IM) is a classical combinatorial optimization problem, which can be widely used in mobile networks, social computing, and recommendation systems. It aims at selecting a small number of users such that maximizing the influence spread across the online social network. Because of its potential commercial and academic value, there are a lot of researchers focusing on studying the IM problem from different perspectives. The main challenge comes from the NP-hardness of the IM problem and \#P-hardness of estimating the influence spread, thus traditional algorithms for overcoming them can be categorized into two classes: heuristic algorithms and approximation algorithms. However, there is no theoretical guarantee for heuristic algorithms, and the theoretical design is close to the limit. Therefore, it is almost impossible to further optimize and improve their performance. With the rapid development of artificial intelligence, the technology based on Machine Learning (ML) has achieved remarkable achievements in many fields. In view of this, in recent years, a number of new methods have emerged to solve combinatorial optimization problems by using ML-based techniques. These methods have the advantages of fast solving speed and strong generalization ability to unknown graphs, which provide a brand-new direction for solving combinatorial optimization problems. Therefore, we abandon the traditional algorithms based on iterative search and review the recent development of ML-based methods, especially Deep Reinforcement Learning, to solve the IM problem and other variants in social networks. We focus on summarizing the relevant background knowledge, basic principles, common methods, and applied research. Finally, the challenges that need to be solved urgently in future IM research are pointed out.

translated by 谷歌翻译

Longitudinal Prediction of Postnatal Brain Magnetic Resonance Images via a Metamorphic Generative Adversarial Network

Yunzhi Huang , Sahar Ahmad , Luyi Han , Shuai Wang , Zhengwang Wu , Weili Lin , Gang Li , Li Wang , Pew-Thian Yap

分类：计算机视觉

2022-08-09

由于受试者辍学或扫描失败，在纵向研究中不可避免地扫描是不可避免的。在本文中，我们提出了一个深度学习框架，以预测获得的扫描中缺少扫描，从而迎合纵向婴儿研究。由于快速的对比和结构变化，特别是在生命的第一年，对婴儿脑MRI的预测具有挑战性。我们引入了值得信赖的变质生成对抗网络（MGAN），用于将婴儿脑MRI从一个时间点转换为另一个时间点。MGAN具有三个关键功能：（i）图像翻译利用空间和频率信息以进行详细信息提供映射；（ii）将注意力集中在具有挑战性地区的质量指导学习策略。（iii）多尺度杂种损失函数，可改善组织对比度和结构细节的翻译。实验结果表明，MGAN通过准确预测对比度和解剖学细节来优于现有的gan。

translated by 谷歌翻译

Graph Representation Learning for Popularity Prediction Problem: A Survey

Tiantian Chen , Jianxiong Guo , Weili Wu

分类：机器学习

2022-03-15

在线社交平台，例如Twitter，Facebook，LinkedIn和微信在过去十年中的发展非常快，并且是人们互相交流和共享信息的最有效平台之一。由于“口口相传”的效果，信息通常可以在这些社交媒体平台上迅速传播。因此，重要的是研究推动信息扩散的机制并量化信息传播的后果。许多努力都集中在这个问题上，以帮助我们更好地理解并在病毒营销和广告中实现更高的性能。另一方面，在过去的几年中，神经网络的发展蓬勃发展，导致大量的图表学习（GRL）模型。与传统模型相比，GRL方法通常被证明更有效。在本文中，我们对现有作品进行了全面的审查，该综述使用GRL方法用于普及预测问题，并根据其主要使用的模型和技术将相关文献分为两个大类：基于嵌入的方法和深度学习方法。深度学习方法进一步分为六个小类：卷积神经网络，图形卷积网络，图形注意力网络，图形神经网络，复发性神经网络和增强学习。我们比较这些不同模型的性能，并讨论它们的优势和局限性。最后，我们概述了受欢迎程度预测问题的挑战和未来机会。

translated by 谷歌翻译

Source-Free Unsupervised Domain Adaptation: A Survey

Yuqi Fang , Pew-Thian Yap , Weili Lin , Hongtu Zhu , Mingxia Liu

分类：计算机视觉 | 人工智能 | 机器学习

2022-12-31

Unsupervised domain adaptation (UDA) via deep learning has attracted appealing attention for tackling domain-shift problems caused by distribution discrepancy across different domains. Existing UDA approaches highly depend on the accessibility of source domain data, which is usually limited in practical scenarios due to privacy protection, data storage and transmission cost, and computation burden. To tackle this issue, many source-free unsupervised domain adaptation (SFUDA) methods have been proposed recently, which perform knowledge transfer from a pre-trained source model to unlabeled target domain with source data inaccessible. A comprehensive review of these works on SFUDA is of great significance. In this paper, we provide a timely and systematic literature review of existing SFUDA approaches from a technical perspective. Specifically, we categorize current SFUDA studies into two groups, i.e., white-box SFUDA and black-box SFUDA, and further divide them into finer subcategories based on different learning strategies they use. We also investigate the challenges of methods in each subcategory, discuss the advantages/disadvantages of white-box and black-box SFUDA methods, conclude the commonly used benchmark datasets, and summarize the popular techniques for improved generalizability of models learned without using source data. We finally discuss several promising future directions in this field.

translated by 谷歌翻译

Multi-modal Molecule Structure-text Model for Text-based Retrieval and Editing

Shengchao Liu , Weili Nie , Chengpeng Wang , Jiarui Lu , Zhuoran Qiao , Ling Liu , Jian Tang , Chaowei Xiao , Anima Anandkumar

分类：机器学习 | 自然语言处理 | (统计)机器学习

2022-12-21

There is increasing adoption of artificial intelligence in drug discovery. However, existing works use machine learning to mainly utilize the chemical structures of molecules yet ignore the vast textual knowledge available in chemistry. Incorporating textual knowledge enables us to realize new drug design objectives, adapt to text-based instructions, and predict complex biological activities. We present a multi-modal molecule structure-text model, MoleculeSTM, by jointly learning molecule's chemical structures and textual descriptions via a contrastive learning strategy. To train MoleculeSTM, we construct the largest multi-modal dataset to date, namely PubChemSTM, with over 280K chemical structure-text pairs. To demonstrate the effectiveness and utility of MoleculeSTM, we design two challenging zero-shot tasks based on text instructions, including structure-text retrieval and molecule editing. MoleculeSTM possesses two main properties: open vocabulary and compositionality via natural language. In experiments, MoleculeSTM obtains the state-of-the-art generalization ability to novel biochemical concepts across various benchmarks.

translated by 谷歌翻译

Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models

Manli Shu , Weili Nie , De-An Huang , Zhiding Yu , Tom Goldstein , Anima Anandkumar , Chaowei Xiao

分类：计算机视觉

2022-09-15

预训练的视觉模型（例如，剪辑）在许多下游任务中显示出有希望的零弹性概括，并具有正确设计的文本提示。最近的作品不依赖手工设计的提示，而是使用下游任务的培训数据来学习提示。虽然有效，但针对领域数据的培训却降低了模型的概括能力，使其无法看到新领域。在这项工作中，我们提出了测试时间提示调整（TPT），该方法可以通过单个测试样本即时学习自适应提示。对于图像分类，TPT通过使用置信度选择最小化熵来优化提示，以便模型在每个测试样本的不同增强视图上都具有一致的预测。在评估对自然分布变化的概括时，TPT平均将零击的TOP-1精度提高了3.6％，超过了先前需要其他特定于任务的训练数据的迅速调整方法。在评估看不见类别的跨数据集泛化时，TPT与使用其他培训数据的最先进方法相当。项目页面：https：//azshue.github.io/tpt。

translated by 谷歌翻译

Retrieval-based Controllable Molecule Generation

Zichao Wang , Weili Nie , Zhuoran Qiao , Chaowei Xiao , Richard Baraniuk , Anima Anandkumar

分类：机器学习

2022-08-23

通过生成模型生成具有特定化学和生物学特性的新分子已成为药物发现的有希望的方向。但是，现有的方法需要大型数据集进行广泛的培训/微调，在现实世界中通常无法使用。在这项工作中，我们提出了一个新的基于检索的框架，用于可控分子生成。我们使用一系列的示例分子，即（部分）满足设计标准的分子，以引导预先训练的生成模型转向满足给定设计标准的合成分子。我们设计了一种检索机制，该机制将示例分子与输入分子融合在一起，该分子受到一个新的自我监督目标训练，该目标可以预测输入分子的最近邻居。我们还提出了一个迭代改进过程，以动态更新生成的分子和检索数据库，以更好地泛化。我们的方法不可知生成模型，不需要特定于任务的微调。关于从简单设计标准到设计与SARS-COV-2主蛋白酶结合的铅化合物的具有挑战性的现实世界情景的各种任务，我们证明了我们的方法外推出了远远超出检索数据库，并且比检索数据库更高，并且比更高的性能和更广泛的适用性以前的方法。

translated by 谷歌翻译

PointDP: Diffusion-driven Purification against Adversarial Attacks on 3D Point Cloud Recognition

Jiachen Sun , Weili Nie , Zhiding Yu , Z. Morley Mao , Chaowei Xiao

分类：计算机视觉 | 机器学习

2022-08-21

3D点云正在成为许多现实世界应用中的关键数据表示形式，例如自动驾驶，机器人技术和医学成像。尽管深度学习的成功进一步加速了物理世界中3D点云的采用，但深度学习因其易受对抗性攻击的脆弱性而臭名昭著。在这项工作中，我们首先确定最先进的经验防御，对抗性训练，由于梯度混淆，在适用于3D点云模型方面有一个重大限制。我们进一步提出了PointDP，这是一种纯化策略，利用扩散模型来防御3D对抗攻击。我们对六个代表性3D点云体系结构进行了广泛的评估，并利用10+强和适应性攻击来证明其较低的稳健性。我们的评估表明，在强烈攻击下，PointDP比最新的纯化方法实现了明显更好的鲁棒性。在不久的将来将包括与PointDP合并的随机平滑验证结果的结果。

translated by 谷歌翻译

Private, Efficient, and Accurate: Protecting Models Trained by Multi-party Learning with Differential Privacy

Wenqiang Ruan , Mingxin Xu , Wenjing Fang , Li Wang , Lei Wang , Weili Han

分类：机器学习

2022-08-18

安全的基于多方计算的机器学习（称为MPL）已成为利用来自具有隐私保护的多个政党的数据的重要技术。尽管MPL为计算过程提供了严格的安全保证，但MPL训练的模型仍然容易受到仅依赖于访问模型的攻击。差异隐私可以帮助防御此类攻击。但是，差异隐私和安全多方计算协议的巨大沟通开销带来的准确性损失使得平衡隐私，效率和准确性之间的三通权衡是高度挑战的。在本文中，我们有动力通过提出一种解决方案（称为PEA（私有，高效，准确））来解决上述问题，该解决方案由安全的DPSGD协议和两种优化方法组成。首先，我们提出了一个安全的DPSGD协议，以在基于秘密共享的MPL框架中强制执行DPSGD。其次，为了减少因差异隐私噪声和MPL的巨大通信开销而导致的准确性损失，我们提出了MPL训练过程的两种优化方法：（1）与数据无关的功能提取方法，旨在简化受过训练的模型结构体; （2）基于本地数据的全局模型初始化方法，旨在加快模型训练的收敛性。我们在两个开源MPL框架中实施PEA：TF-Conteded和Queqiao。各种数据集的实验结果证明了PEA的效率和有效性。例如。当$ {\ epsilon} $ = 2时，我们可以在LAN设置下的7分钟内训练CIFAR-10的差异私有分类模型，其精度为88％。这一结果大大优于来自CryptGPU的一个SOTA MPL框架：在CIFAR-10上训练非私有性深神经网络模型的成本超过16小时，其精度相同。

translated by 谷歌翻译

A Semantic-aware Attention and Visual Shielding Network for Cloth-changing Person Re-identification

Zan Gao , Hongwei Wei , Weili Guan , Jie Nie , Meng Wang , Shenyong Chen

分类：计算机视觉

2022-07-18

改变布料的人重新识别（REID）是一个新出现的研究主题，旨在检索换衣服的行人。由于带有不同衣服的人类外观表现出较大的变化，因此现有方法很难提取歧视性和健壮的特征表示。当前的作品主要集中在身体形状或轮廓草图上，但是人类的语义信息以及换衣服之前和之后的行人特征的潜在一致性未被充分探索或被忽略。为了解决这些问题，在这项工作中，提出了一种新颖的语义意识到的注意力和视觉屏蔽网络，用于换衣服的人Reid（缩写为SAV），其中关键的想法是屏蔽与衣服外观相关的线索，只关注衣服的外观对视图/姿势变化不敏感的视觉语义信息。具体而言，首先采用了视觉语义编码器来基于人类语义分割信息来定位人体和服装区域。然后，提出了人类的语义注意模块（HSA），以突出显示人类的语义信息并重新授予视觉特征图。此外，视觉服装屏蔽模块（VCS）还旨在通过覆盖衣服区域并将模型集中在与衣服无关的视觉语义信息上来提取更健壮的特征代表。最重要的是，这两个模块在端到端统一框架中共同探索。广泛的实验表明，所提出的方法可以显着胜过最先进的方法，并且可以为换衣的人提取更健壮的特征。与FSAM（在CVPR 2021中发布）相比，该方法可以分别在LTCC和PRCC数据集上以MAP（RANK-1）的形式获得32.7％（16.5％）和14.9％（ - ）。

translated by 谷歌翻译